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String diagrams are a powerful tool for reasoning about composite structures in symmetric monoidal 
categories. By representing string diagrams as graphs, equational reasoning can be done automati¬ 
cally by double-pushout rewriting. 1-graphs give us the means of expressing and proving properties 
about whole families of these graphs simultaneously. While !-graphs provide elegant proofs of sur¬ 
prisingly powerful theorems, little is known about the formal properties of the graph languages they 
define. This paper takes the first step in characterising these languages by showing that an important 
subclass of '-graphs—those whose repeated structures only overlap trivially—can be encoded using 
a (context-free) vertex replacement grammar. 


1 Introduction 

String diagrams are essentially directed graphs, but instead of edges they have a more flexible notion 
of wires. Wires can be left open at one or both ends to form inputs and outputs to the diagram, or 
can be connected to themselves to form a circle. They have a formal semantics given by monoidal 
category theory Q, and have found applications in many fields. For example: in models of concurrency, 
they give an elegant presentation of Petri nets with boundary lfT5l . in computational linguistics, they 
are used to compute compositional semantics for sentences 01, in control theory, they represent signal- 
flow diagrams JUE1, and in theoretical physics, they provide the formal language of categorical quantum 
mechanics 10. A string graph encodes a string diagram as a typed, directed graph, by replacing the wires 
with chains of edges containing special dummy vertices called wire-vertices (see Fig.|T|). By contrast, the 
“real” vertices, labelled here f,g and li are called node-vertices. String graphs—originally introduced 
under the name “open graphs” in J7]—have the advantage that they are purely combinatoric (as opposed 
to geometric) objects. Equational reasoning on string diagrams is done by replacing sub-diagrams. The 



Figure 1: A string diagram, and its encoding as a string graph 
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presence of dummy vertices in string graphs allows this to be done with double-pushout (DPO) graph 
rewriting. For example, in: 

£ 


f 

the LHS (shown in green above) is replaced by the RHS (blue), using the common boundary (red). This 
type of rewriting for string graphs is implemented in the graphical proof assistant Quantomatic lfl2l . 




Often, one wants to reason not just about single string graphs, but entire families of them. However, 
informal notions of graphs with repetition are ill-suited for automated tools. To address this problem, !- 
graphs (pronounced “bang-graphs”) were introduced in J6) and formalised for string graphs in ifTO . The 
idea behind 1-graphs is that certain marked subgraphs (along with their adjacent edges) can be repeated 
any number of times, in a manner somewhat analogous to the Kleene star. These marked subgraphs are 
called !-boxes. For example: 



The !-graph in the semantic brackets represents the depicted set of string graphs. More precisely, an 
instance of a !-graph is obtained by repeatedly applying of the two operations EXPAND and KILL on 
the !-graph: 




( 1 . 2 ) 


until all !-boxes have been eliminated. The set of concrete graphs (i.e. those not containing any !-boxes) 
obtainable from a !-graph via these operations is called the language of the !-graph. 


We call the set of languages expressible by 1-graphs BG. In this paper, we will compare the expres¬ 
siveness of this language to context-free vertex replacement grammars. We will focus on confluent 
neighbourhood-controlled embedding grammars, with directions and edge-labels, i.e. C-edNCE gram¬ 
mars lfl4l . 


In arbitrary 1-graphs, the 1-boxes are allowed to nest inside of each other, but also overlap. The latter can 
generate patently non-context-free behaviour. Thus, it is natural to consider a restricted set of languages, 
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1-Graphs with Trivial Overlap are Context-Free 


LIN-edNCE BG 



Figure 2: Relationship between LIN-edNCE, BG and BGTO 


BGTO of .’-graphs with trivial overlap. In this paper, we will give an algorithm for producing a C- 
edNCE grammar that is furthermore linear (LIN-edNCE) which reproduces the language of any !-graph 
with trivial overlap. We therefore show the relationship between graph languages illustrated in Fig. [2] 

After introducing some preliminaries on string graphs, !-graphs, and C-edNCE grammars in Section [2] 
we will give an encoding of 1-graphs into LIN-edNCE grammars in Section [3] In that section, we also 
demonstrate that LIN-edNCE contains languages not in BG (and hence is strictly larger than BGTO). 
Finally, we give one conjecture and discuss the prospects for reasoning with context-free string graph 
grammars in a manner analogous to the 1-graph case in Section[4] 

2 Preliminaries 

Definition 2.1 (Graph fll4l ). A graph over an alphabet of node labels £ and an alphabet of edge labels T 
is a tuple H = (V,E,X), where V is a finite set of nodes, E C {(v, y, w)|v,w E V, v / w, y E T} is the set 
of edges and X : V —> £ is the node labelling function. The set of all graphs with labels £,T is denoted 
GR Z . r . 

Remark 2.2. Note, that this definition of graphs does not allow for self-loops. This is a standard (and 
convenient) assumption in the C-edNCE literature. By using this notion of graphs, we do not lose any 
expressivity for string graphs, because any string graph with a self-loop is wire-homeomorphic (see 
Definition |2.6| ) to a string graph with no self-loops. 

Definition 2.3 (String Graph). A string graph is a graph labelled by the set {A,W}, where vertices 
labelled N are called node-vertices and vertices labelled W are called wire-vertices, and the following 
conditions hold: (1) there are no edges directly connecting two node-vertices, (2) the in-degree of every 
wire-vertex is at most one and (3) the out-degree of every wire-vertex is at most one. 

We will depict wire vertices as small black dots and node-vertices as larger white circles, as we have 
already done in Section[T] For simplicity, we assume there is only one type of node-vertex, but we could 
easily capture string graphs like |T] by taking the labels to be {Nj,N g ,Nh,W}, for example. 

We also identify those wire-vertices which form the boundary of a string graph: 

Definition 2.4 (Inputs, Outputs and Boundary Vertices). A wire-vertex of a string graph G is called an 
input if it has no incoming edges. A wire-vertex with no outging edges is called an output. We denote 
with In{G) and Out(G) the string graphs with no edges whose vertices are respectively the inputs and 
outputs of G. The boundary of G is Bound(G ) := In(G) U Out{G). 
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Just as node-vertices represent the boxes in a string diagram, chains of wire-vertices represent the wires. 

Definition 2.5 (Wires). A connected chain of vertices where each endpoint is either a boundary or a 
node-vertex, and all other vertices are wire-vertices is called a closed wire. A closed wire minus its 
endpoints is called the interior of a closed wire. A cycle consisting only of wire-vertices is called a 
circle. 

Often, it is more natural to consider an equivalence class of string graphs, up to wire homeomorphism. 

Definition 2.6 (Wire-homeomorphic string graphs). Two string graphs G and G' are called wire-homeo- 
morphic, written G ~ G' i f G' can be obtained from G by either merging two adjacent wire-vertices (left) 
or by splitting a wire-vertex into two adjacent wire-vertices (right) any number of times: 

*->•->•-> * i —y *- >• -> * *->•-> * i —y *- >* ->•-> * 


That is, two graphs are wire-homeomorphic if the only dif¬ 
ference between them is the number of wire-vertices used to 
represent a wire in the diagram (see Fig{3]). 

Next, we provide definitions for !-graphs and related notions. 

First, we need to identify which subgraphs are legal to put in 
!-boxes. 

Definition 2.7 (Open Subgraph). A subgraph O of a string Figure 3: Wire homeomorphism 
graph H is said to be open if it is a full subgraph and further¬ 
more In(H\0) C ln(H) and Out(H\0 ) C Out(H). 

Definition 2.8 (!-graph). A .'-graph H is a string graph, a partially ordered set !(//) of !-boxes, and an 
open subgraph B(b ) C H for every b €\{H) such that b < b' B{b ) C B{b'). 



The notion of openness ensures that we never put only part of a wire inside a !-box, which would cause 
wires to “split” in the middle as we expand the !-box, which violates the arity conditions for wire-vertices 
from Definition 12. 3 1 

Iffi 2 < b\, this means that bi is nested inside of b \, which we indicate by drawing a line connecting their 
corners. This is to distinguish from the case where b\ and bi merely overlap. 



Definition 2.9 (Concrete Graph). A !-graph with no !-vertices is called a concrete graph or simply a 
string graph. 


A !-graph represents an infinite set of concrete string graphs, each of which is obtained after a finite 
number of applications of the EXPAND and KILL operations presented below. 
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Definition 2.10 (!-box Operations). The two primitive !-box operations are as follows: 

• KILL,(G) = G'\B{b) 

* EXPAND, (G) = G U KILLb[G) G ' 

where G' has the same underlying string graph as G, but with b and all of its children removed from ! (G). 
The expression for EXPAND is the disjoint union of G and G' , with the vertices, edges, and !-boxes in 
the common subgraph KILL,(G) identified. 

As seen in ( |1.2[ ), KILL, removes a !-box b and its contents, whereas EXPAND, inserts a new copy of 
the contents of b. See iflOl for a more detailed description of these operations. 

Definition 2.11 (!-graph Language). The language of a !-graph G is the set of all concrete string graphs 
obtained by applying a sequence of !-box operations on G. 

Note that overlapping !-boxes are not explicitly ruled out. However, they can create some odd non-local 
behaviour. So, naively, one might want to rule out overlap entirely, but it turns out that certain instances 
of overlap are more innocent than others. 

Definition 2.12 (Overlap and Trivial Overlap). Given a pair of non-nested !-boxes b\ and 1)2, we say that 
b\ and hi are overlapping if B(b\ ) OBibj) 9 0. b\ and bn overlap trivially if B(b\ ) FBibj) consists of 
only the interior of zero or more closed wires, where one endpoint is a node-vertex only in B(b\) and the 
other is a node-vertex only in B{bi). 

In particular, B{b\) Hi Blbn) can be empty, so trivial overlap 
generalises no-overlap. In Fig.[4]Zq and hi overlap trivially, 
since they only overlap on the interior of a wire connect¬ 
ing node-vertices in the two boxes, bn, and 1)4 overlap non- 
trivially, since they overlap on a node-vertex. The most sub¬ 
tle case is b 5 and /;,, which overlap non-trivially because the 
shared wire-vertices are not part of a wire whose endpoints 
are in distinct !-boxes. Of course, all other pairs of !-boxes 
overlap trivially, as their intersection is just the empty set. 

Definition 2.13 (BGTO). A !-graph where any two non¬ 
nested !-boxes overlap trivially is called a .'-graph with trivial overlap. The set of all languages induced 
by these !-graphs is called BGTO. 

Next, we introduce C-edNCE graph grammars. Graph grammars are a generalization of context-free 
grammars for strings. A graph grammar consists of a finite collection of productions which specify 
instructions on how to generate a family of graphs. Like their context-free string counterparts, context- 
free graph grammars have better structural, decidability and complexity properties compared to other 
more expressive mechanisms for graph transformation. We use the same definitions and conventions for 
graph grammars as presented in lfT4l . which also describes C-edNCE grammars and their properties in 
much greater detail. 

C-edNCE grammars are built using graphs with embedding. These provide the information needed to 
replace a non-terminal node with a new graph, and update connections accordingly. 
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Figure 4: Trivial vs non-trivial overlap 
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( 1 ) ( 2 ) ( 3 ) 




Figure 5: (1) Graph with embedding ( H,Ch) (2) Graph with embedding (D.C/,) (3) Result of substitution 
(H.C/i)[v/(D.Cj))\ where v is the vertex with non-terminal label in H. 


Definition 2.14 (Graph with embedding lfT4iO . A Graph with embedding over labels Z.F is a pair ( H,C ), 
where H is a graph over Z. F and C C Z x T x F x V// x {in, out}. C is called a connection relation and its 
elements are called connection instructions. The set of all graphs with embedding over Z. F is denoted 
by GREzp. 

Graph grammars operate by substituting a graph (with embedding) for a non-terminal node of another 
graph. Connection instructions are used to introduce edges connected to the new graph based on edges 
connected to the non-terminal. A connection instruction as (<7,f3/y,x,d) says to add an edge labelled y 
connected to the vertex x in the new graph, for every /J -labelled edge connecting a cr-labelled vertex to 
the non-terminal, d then indicates whether this rule applies to in-edges or out-edges of the non-terminal. 
More formally: 

Definition 2.15 (Graph Substitution |[74j). Let (. H,Ch),(D,Cd ) E GREy\ be two graphs with embed¬ 
ding, where H and D are disjoint. Let v E Vh be a node of H. The substitution of ( D,Cd ) for v in ( H. Ch ) 
is denoted by (H,Ch)[v/(D,Cd)\ and is given by the graph with embedding whose components are: 

V = (V[[ — {v}) U Vo 

E = {(x,y,y) eE H \x^v,y^v}UE D 

U{(w, y,x) | 3)3 eF: (wj 3, v) € E h ,(Ah(w),/J /y,x,in) E C D } 

U {(x,y,w) | 3/3 E T: (v,j3,w) E E H ,(^ H (w),P/Y,x,out) E C D } 

= I'M*) if-v e (V// — {v}) 

[Az)(x) if xeV D 

C = {(a,P/y,x,d) eC H \x^v} 

\J{{a,fi/8,x,d) | 3yer : (a,p/y,v,d) E C H , {o,y/8,x,d) eC D } 


Thankfully, graphs with embedding allow for a simple graphical presentation (see Fig. [5]). We draw 
graphs as in the previous sections, but with the additional convention that nodes with non-terminal labels 
are drawn as boxes instead of circles. A single connection instruction is drawn as a pair of edges crossing 
the outer frame. The edge and node types outside of the frame specify neighbouring node(s) which are 
connected to the non-terminal in the host graph and the edge(s) inside the frame indicate what new edges 
will be constructed to and from those nodes. 
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Next, we define the concept of an edNCE Graph Grammar. edNCE is an abbreviation for Neighbourhood 
Controlled Embedding for directed graphs with dynamic edge relabeling. 

Definition 2.16 (edNCE Graph Grammar 1141 ). An edNCE Graph Grammar is given by a tuple G = 
(£,A,r ,D.,P,S), where £ is the alphabet of node labels, A C £ is the alphabet of terminal node labels, F 
is the alphabet of edge labels, G C F is the alphabet of final edge labels, P is a finite set of productions 
and S E £ — A is the initial nonterminal. Productions are of the form X —>■ (D,C), where X E £ — A is a 
non-terminal node and (D,C) E GREy \ is a graph with embedding. 

We can graphically depict an edNCE grammar by depicting all of its productions. A production is drawn 
exactly like a graph with embedding with the addition that we depict its associated non-terminal in the 
top-left part of the frame. 

Example 2.17. The following grammar generates the set of all chains of node vertices with an input and 
no outputs: 




x 


Definition 2.18 (Derivation Step Ifl4l ). For a graph grammar G = CL.A.F.Q..P.S) and graphs (with 
embedding) H,H' E GREz,r consider a vertex v E Vh and an isomorphic copy p : X —> ( D,C) of some 
production p’ E P. We say H => vp H’ is a derivation step if A#(v) = X and H’ = H[v/ (D,C)]. If v and p 
are clear from the context, then we write H => H'. A sequence of derivation steps Hq =^ Vl>Pl Ho =^ V2 ,P 2 
••• => Vn , Pn H n is called a derivation. A derivation is called creative if Ho and rhs{p ,-) are all mutually 
disjoint. We write H =>* H 1 if there exists a creative derivation from H to H’. 

Example 2.19. A derivation in the above grammar of the string graph with three node vertices: 


m s i V 1 —S m s l V m Sm Sm s\ Y m S a 


where again we color the newly established edges in red. 

Definition 2.20 (Starting Graph lfl4l ). We say that sn(S,z) E GRz,r is a starting graph if it has only one 
node given by z, its label is S, the graph has no edges and no connection instructions. 

Definition 2.21 (Graph Grammar Language Ifl4ll ). The graph language induced by a graph grammar 
G = (£, A,r,G,P,S) is given by L{G ) = {[//] | H E GR\si and sn(S,z ) =>* H for some z}, where [H] 
denotes the equivalence class of all graphs which are isomoiphic to H. 

Definition 2.22 (Confluence Ifl4l ). We say that a graph grammar G = CL.A.F.O.P.S) is confluent if the 
following holds for every graph H E GREz, r- if H =N, 1)Pl H i =^„ 2 . /)2 H n and H =^„ 2iP2 H 2 =N, liPl H 2 i are 
creative derivations of G with u\ u 2 , then H\ 2 = H 2 \. 


There are simple and easily decidable conditions for determining if an edNCE grammar is confluent. The 
class of confluent edNCE grammars is denoted C-edNCE. 

Definition 2.23 (LIN-edNCE grammar lfl4ll ). An edNCE grammar G is linear, or a LIN-edNCE gram¬ 
mar, if for every production X -p- (D,C), D has at most one nonterminal node. 

At any given point, only one non-terminal can be replaced, so LIN-edNCE grammars are automatically 
confluent. Hence, they are a subclass of C-edNCE grammars. 
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3 !-graphs vs C-edNCE grammars 

In this section we outline the relationship between !-graph languages and context-free languages de¬ 
scribed by C-edNCE grammars and some of their respective subclasses. In subsection |3. 1 [ we show that 
the language induced by any !-graph H with no overlapping !-boxes can be described by a LIN-edNCE 
grammar, which can moreover be constructed effectively from H. In subsection |3.2| we first show that 
there exists a !-graph with trivial overlap of !-boxes whose language cannot be directly represented using 
any C-edNCE grammar. Next, we show that the language of any !-graph H with trivial overlap between 
!-boxes can be represented by a LIN-edNCE grammar up to wire-encoding and the grammar can be ef¬ 
fectively constructed from H as well. In subsection |3.3| we show that the classes of languages induced 
by (unrestricted) I-graphs and C-edNCE grammars respectively are incomparable. 

3.1 !-graphs without overlap 

The main result of this subsection is a theorem stating that the language induced by any !-graph with no 
overlapping of !-boxes can be directly represented by a LIN-edNCE grammar, which can moreover be 
generated effectively. 

Before we present the main results in this subsection and the next one, we prove a series of lemmas 
which are used in the proof of the two main theorems. Each lemma describes how to build bigger LIN- 
edNCE grammars out of smaller ones which in turn describe the language induced by certain subgraphs 
of a given !-graph. All of our constructions are effective in the sense that they can be performed by a 
computer. 

We introduce some conventions that are used throughout our proofs. Given a !-graph H, its vertices will 
be {vi, V 2 -, v„}. To these vertices, we will associate non-final edge labels {cti, «2 ,•••,««} which will be 
used in the productions of our grammars. Informally, an edge labeled with a,- will have as its source or 
target either the original vertex v, or one of its copies. This is used in some productions of our grammars, 
so that we can easily refer to all copies of such a vertex at once and connect them to other vertices. 

Throughout the proof, we will use grammars satisfying the following conditions: 

1. The grammar is in LIN-edNCE, thus every production has at most one non-terminal node in it 

2. There is a single final production which is the empty graph 

3. Every sentential form with a non-terminal node is such that all terminal vertices are connected 
with an edge to the non-terminal node (in both directions) and every such edge has as label some 
a i 

4. Every production except the final one has connection instructions which specify that both incoming 
and outgoing edges of type a, are connected to the non-terminal. Graphically, we will depict that 
using bidirectional edges as a shorthand notation for an edge in each direction. 

5. For every /, there is at most one terminal vertex in the productions of a grammar which is incident 
to an edge with label a, 

For convenience, we will refer to a grammar satisfying 1-5 as being in !-linear form. 

Lemma 3.1. Given a concrete string graph H. there exists a LIN-edNCE grammar '4 which generates 
the language {H}. Moreover, this grammar can be effectively generated and is in !-linear form. 
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Proof. This is done by the following grammar: 


s 



F 


where the set of vertices of H is Vh = {vi,V 2 , and each edge with label a, has as source or target 

Vi and the non-terminal F. □ 

Lemma 3.2. Given a .'-graph FI and a '.-linearform grammar which generates the same language as 
H, there exists a grammar If' which generates the same language as the following .'-graph: 



Moreover, ( S' can be effectively generated and is in .'-linear form. 

Proof. Let’s assume that S is the starting production of F is the final production of ‘S and that S' and 
F' are not productions of . 

First, we modify the production F to be the following: 


F 



Finally, to the productions of we add the following productions, where S' is the starting one: 


s' 



s' 



F' 


A derivation S' =$■ S =>■ • • • F creates a concrete string graph from the language of H, so it is simulating 
a single EXPAND operation applied to the top-level !-box, together with a concrete instantiation of the 
!-boxes in H. By construction, we can iterate this, thus allowing us to generate multiple disjoint concrete 
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graphs, all of which are in the language of H. A derivation S' => F' simulates the final KILL operation 
applied to the top-level !-box. □ 

Lemma 3.3. Given disjoint!-graphs H, K and grammars Fi \, f /i 2 which generate the same languages as H 
and K respectively, then there exists a grammar C S' which generates the same language as the following 
!-graph: 


Moreover, ( S' can be effectively generated and is in !-linear form. 


Proof Let the vertices of FI be {vj, V 2 ,..., v k } and let the vertices of K be {vy + |, v k+2 , ■■■, v„}. Also, let 
Si and Fj be the starting and final productions respectively of %. First, we modify each production X of 
( F \, except F\, by adding connection instructions for edge labels a , k+ \,..., 0C n in the following way (left): 


x 




where the new additions are colored in red. This doesn’t change the language of Sfi and is done so that 
we can put the grammar in the required form. Similiarly, modify all productions of ‘S- \, except F 2 , by 
adding to their connection instructions the missing edge labels cq, a 2 , ■. ., cq. Finally, modify F\ to be 
the production depicted above (right), so that we can chain together the two grammars. 

The required grammar Sf' has as its productions the modified productions of ( 4\ and % with starting 
production Si. A derivation Si =>■ ••• =>■ F\ creates a concrete graph from the language of FI and a 
derivation S 2 =A • • • =A F 2 creates a graph from the language of K. By chaining the two grammars, we 
simply generate two disjoint concrete graphs, one from the language of H and one from the language of 
K, as required. □ 


Lemma 3.4. Given !-graph H, where FI contains a l-box b and given a !-linear form grammar ( F which 
generates the same languages as H, there exist grammars C S’ and < S" which generates the same languages 
as: 



and 







respectively, where in both cases, the newly depicted edge (colored in red) is incident to the !-box b 
in H and the edge is also incident to a node-vertex in H which is not in any !-boxes. Moreover, these 
grammars can be effectively generated and are in !-linear form. 
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Proof. In both cases, for the newly depicted edge, identify the wire-vertex as v, and the node-vertex as 
vj. To get ( /P, identify the unique production X of Sf, such that X contains a node-vertex incident to an 
edge with non-final label ctj. Then add to its connection instructions a new edge in the following way 
(left): 



where the red-colored edge is the new addition. To get the grammar C S", follow a similar same procedure 
(shown on the right above). This modification has the effect that we connect all copies of the wire vertex 
to the single node vertex (in the appropriate direction), which is the only change required compared to 
the concrete graphs of H. □ 

Theorem 3.5. Given a .'-graph H such that it doesn ’t have any overlapping !-boxes, there exists a L1N- 
edNCE grammar G which generates the same language as H. Moreover, this grammar can be effectively 
generated and is in .'-linear form. 


Proof. We present a proof by induction on the number of !-boxes of H. 

For the base case, if PI has no !-boxes, then lemma [3TT| completes the proof. 

For the step case, pick any top-level !-box and let’s consider the full subgraph of H it induces. Call this 
subgraph K. Any vertex v of H\K which is adjacent to K must be a node-vertex, because otherwise this 
would violate the openness condition of !-boxes. Let w E K be a wire-vertex that v is adjacent to and let 
e be the edge connecting v to w. 


K 


H\K 


w e 

>- 



If v is in some !-box b, then the openness condition of !-boxes implies that w must also be in b. However, 
we have assumed that H does not contain overlapping !-boxes, so this is not possible and thus v is not in 
any !-boxes. Therefore, we can use lemma [T4| to reduce the problem to showing that we can effectively 
generate a grammar for H\e. Similarly, by applying the same lemma multiple times, we can reduce the 
problem to showing that we can effectively generate a grammar for the !-graph consisting of the disjoint 
Lgraphs K and H\K. Applying lemma 3.3 and the induction hypothesis then reduces the problem to 
showing the theorem for K. Finally, we can apply lemma 3.2 to K and then the induction hypothesis to 
complete the proof. □ 
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3.2 !-graphs with trivial overlap 

We begin by providing an example of a !-graph with trivial overlap of !-boxes 
which cannot be directly represented using any C-edNCE grammar. Then, we 
prove the second main theorem of this work which shows that the language of 
any !-graph with trivial overlap of !-boxes can be represented by a LIN-edNCE 
grammar up to wire-encoding. The construction of the grammar is also effective. 

Proposition 3.6. The generative power of C-edNCE grammars and Hyperedge 

Replacement grammars on string graphs is the same. . 

Figure 6 : '-graph 

with no direct en- 

Proof The graph ^ 3,3 is not a subgraph of any string graph. Then, the proposition coding in C-edNCE 
follows immediately from the main result in 0 . □ 

Proposition 3.7. The language induced by the !-graph with trivial overlap of !- 
boxes shown in Fig. [ 6 ] cannot be directly described using any C-edNCE grammar. 



Proof. The language of the above !-graph is given by: 



In other words, it’s the set of complete bipartite graphs K m n where each node-vertex is connected to other 
node vertices through a single wire vertex. 


Let us assume for contradiction that there exists a C-edNCE grammar which generates L. From propo¬ 
sition |3.6[ it follows that L can be described using a hyperedge replacement grammar. Then, a simple 
application of the pumping lemma for hyperedge-replacement languages 0 yields a contradiction. □ 


Note, however, that the graph language of complete bipartite graphs K mn (without wire vertices in- 
between) can be generated by a LIN-edNCE grammar. Moreover, this language is wire-homeomorphic 
to the language in the above proposition. However, removing the wire vertices from our C-edNCE 
language brings another complication - we need to account for parallel edges between node vertices by 
introducing additional labels on the parallel edges, because in most of the literature, C-edNCE grammars 
do not allow for parallel edges of the same type. 

Using these two ideas, we define the notion of wire-encoding and then show the main result of this 
subsection. 

Definition 3.8 (Wire-encoding). We say that two graphs H and H' are equal up to wire-encoding, if we 
can get one from the other by replacing every edge with special label by a closed wire with endpoints 
the source and target of the original edge. 
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We also say that two graph languages L and L' are equal up to wire-encoding, if there exists a bijection 
/ : L —> l!, s.t. for every H G L, H and f(H) are equal up to wire-encoding. 

Lemma 3.9. Given !-graph FI which contains non-nested !-boxes h\ and hi and given a /-linear form 
grammar ( Ft which generates the same languages as H, there exists a grammar C S' which generates the 
same language, up to wire-encoding, as the following /-graph: 


* o 


■ 

; 'vj 




where the new additions are coloured in red and the newly depicted wire-vertex is in both h\ and hi. 
Moreover, this grammar can be effectively generated and is in !-linear form. 


Proof For the newly depicted edges, identify the source node-vertex as v,- and the target node-vertex as 
Vj. To get the desired grammar ( /P, identify the unique production X of , such that X contains a node 
incident to an edge with non-final label OLj. Then, add to its connection instructions a new edge with 
(final) label fip. 


x 



where the new addition is coloured in red. Note, that with this construction, we are not creating the 
depicted wire-vertex, nor any of its copies. We are connecting all copies of the node-vertex v; to all 
copies of the node-vertex vj directly with edges labelled with ff ,. 


□ 

Theorem 3.10. Given a /-graph H such that the only overlap between /-boxes in FI is trivial, then 
there exists a LIN-edNCE grammar G which generates the same language as H, up to wire-encoding. 
Moreover, this grammar can be effectively generated and is in !-linear form. 


Proof. The proof is the same as for the previous theorem, except that we have to consider an additional 
case, namely, when the node-vertex v is in a !-box. 


b 



w 

_vF :!. 
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In this case, the wire-vertex w is in both !-boxes and we can apply lemma 3.9 to reduce the problem to 
showing that H\w can be handled. If we set W to be the set of all wire vertices which overlap with b and 
another !-box, then applying the same lemma multiple times reduces the problem to showing that H\W 
can be handled. However, we have assumed that only trivial overlap between !-boxes exists in H and 
therefore in H\W there is no overlap between b and any other !-boxes. Then, the proof can be finished 
using the same arguments as in the previous theorem. □ 


3.3 The power of context-free languages 

In this subsection, we will show that there exists a LIN-edNCE language which cannot be induced by 
any (unrestricted) !-graph. We start by providing some definitions. 

Definition 3.11 (Maximum distance). For a graph G and vertices v,u£G, the distance between u and v 
is the length of the shortest path connecting u and v. If there is no path between u and v then we say that 
the distance is -1. The distance between a vertex and itself is 0. The maximum distance for a graph G is 
the largest distance among all pairs of vertices. 

Definition 3.12 (Bounded maximum distance). For a set of graphs <£ = {G\ , G 2 , G 3 ,...}, we say that 
is of bounded maximum distance if there exists an integer n e N, such that the maximum distance for 
every graph in <£ is smaller than n. If such an 11 does not exist, then we say that Sf is of unbounded 
maximum distance. 

Proposition 3.13. The language induced by any .'-graph is of bounded maximum distance. 


Proof. Consider a !-box B CG. Applying a KIFF operation to B cannot increase the maximum distance. 
Applying an EXPAND operation once could potentially increase it, however, applying an EXPAND more 
than once will not increase it any further. Thus, regardless of how many times an EXPAND operation is 
applied to B the maximum distance can only increase by a fixed amount. Also, because of the symmetric 
properties of the EXPAND map, any nested !-boxes within B or any overlapping !-boxes can increase the 
maximum distance with a fixed amount as well regardless of how many EXPAND operations are applied 
to any of them. 

By combining the above observation with the fact that G has finitely many !-boxes we can conclude that 
the set of graphs induced by G is of bounded maximum distance. □ 

In the next proposition, we show that even severely restricted graph grammars can generate languages 
which are not induced by any !-graph, thereby establishing that C-edNCE BG. 

Proposition 3.14. The language of the LIN-edNCE grammar of Example \ 2.1 7\ 


S XX 



O V 










is not induced by any .'-graph, even up to wire-encoding. 
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Proof. The language generated by this grammar is of unbounded maximum distance and thus it cannot 
be generated by any !-graph. □ 


4 Conclusion and future work 

We have shown that the language of a !-graph with trivial 
overlap can always be encoded using a context-free gram¬ 
mar. BGTO languages are in fact a natural restriction to BG, 
and have already arisen in alternative !-box formalisations, 
such as the (non-commutative) !-tensors described in fTO. 

Thus, gaining a better understanding of BGTO languages is 
valuable in its own right. The presence of non-trivial over¬ 
lap tends to cause highly non-local effects when expanding 
!-boxes, so it could be the case that the property of being 
context-free actually characterises trivial overlap. In other 
worc j s - Figure 7: Conjectured relationship be¬ 

tween C-edNCE and BG 

Conjecture 4.1. The language induced by any .'-graph 
which contains !-boxes whose overlap is non-trivial cannot 
be described by a C-edNCE grammar, even up to wire-encoding. 

If the conjecture holds, then the Venn Diagram in Section [T] would simplify to the one in fig. [7} which 
lends credence to the notion that BGTO languages are the string graph analogue to regular languages. 

Quite aside from classification issues, the next step for context-free string graphs grammars is to develop 
the tools for working with them. For example, since they are built on top of string graphs, 1-graphs can 
be plugged together along inputs and outputs to get new !-graphs, and more importantly, they can be 
used to define .'-graph rewrite rules: 




Just as 1-graphs represent families of string graphs, 1-graph rewrite rules represent families of string graph 
rewrite rules. Furthermore, new 1-graph rules can be introduced from concrete ones using a technique 
called !-box induction Ifl3ll . The natural next step in this program then is to see if composition, rewriting, 
and a notion of graphical induction cany through to the context-free case. 
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